AITopics | introspective distillation

Introspective Distillation for Robust Question Answering

Neural Information Processing SystemsDec-24-2025, 10:17:07 GMT

Question answering (QA) models are well-known to exploit data bias, e.g., the language prior in visual QA and the position bias in reading comprehension. Recent debiasing methods achieve good out-of-distribution (OOD) generalizability with a considerable sacrifice of the in-distribution (ID) performance. Therefore, they are only applicable in domains where the test distribution is known in advance. In this paper, we present a novel debiasing method called Introspective Distillation (IntroD) to make the best of both worlds for QA. Our key technical contribution is to blend the inductive bias of OOD and ID by introspecting whether a training sample fits in the factual ID world or the counterfactual OOD one. Experiments on visual QA datasets VQA v2, VQA-CP, and reading comprehension dataset SQuAD demonstrate that our proposed IntroD maintains the competitive OOD performance compared to other debiasing methods, while sacrificing little or even achieving better ID performance compared to the non-debiasing ones.

electronic proceedings, introspective distillation, name change, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.45)

Add feedback

Appendix for " Introspective Distillation for Robust Question Answering " A Causal QA Model Figure A1: Causal graph for QA

Neural Information Processing SystemsAug-15-2025, 16:39:30 GMT

Figure A1 shows the causal graph for QA. We use indirect effects as the predictions of OOD teachers. All the used datasets are open-sourced for research use. We train the teacher model following the source codes. For the student model, we use the same VQA main branch, the baseline model UpDn, as implementation.

machine learning, question answering, teacher model, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.44)

Add feedback

Introspective Distillation for Robust Question Answering

Neural Information Processing SystemsJan-13-2025, 20:18:37 GMT

Question answering (QA) models are well-known to exploit data bias, e.g., the language prior in visual QA and the position bias in reading comprehension. Recent debiasing methods achieve good out-of-distribution (OOD) generalizability with a considerable sacrifice of the in-distribution (ID) performance. Therefore, they are only applicable in domains where the test distribution is known in advance. In this paper, we present a novel debiasing method called Introspective Distillation (IntroD) to make the best of both worlds for QA. Our key technical contribution is to blend the inductive bias of OOD and ID by introspecting whether a training sample fits in the factual ID world or the counterfactual OOD one.

introspective distillation

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.66)

Add feedback

Introspective Distillation for Robust Question Answering

Niu, Yulei, Zhang, Hanwang

arXiv.org Artificial IntelligenceNov-1-2021

Question answering (QA) models are well-known to exploit data bias, e.g., the language prior in visual QA and the position bias in reading comprehension. Recent debiasing methods achieve good out-of-distribution (OOD) generalizability with a considerable sacrifice of the in-distribution (ID) performance. Therefore, they are only applicable in domains where the test distribution is known in advance. In this paper, we present a novel debiasing method called Introspective Distillation (IntroD) to make the best of both worlds for QA. Our key technical contribution is to blend the inductive bias of OOD and ID by introspecting whether a training sample fits in the factual ID world or the counterfactual OOD one. Experiments on visual QA datasets VQA v2, VQA-CP, and reading comprehension dataset SQuAD demonstrate that our proposed IntroD maintains the competitive OOD performance compared to other debiasing methods, while sacrificing little or even achieving better ID performance compared to the non-debiasing ones.

inductive bias, ood performance, ood-teacher, (14 more...)

arXiv.org Artificial Intelligence

2111.01026

Genre: Research Report (0.64)

Industry: Education (0.70)

Technology: